Disentangled Multimodal Representation Learning for Recommendation
نویسندگان
چکیده
Many multimodal recommender systems have been proposed to exploit the rich side information associated with users or items (e.g., user reviews and item images) for learning better representations improve recommendation performance. Studies from psychology show that individual differences in utilization of various modalities organizing information. Therefore, a certain factor an (such as appearance xmlns:xlink="http://www.w3.org/1999/xlink">quality ), features different are varying importance user. However, existing methods ignore fact contribute differently towards user's preference on factors item. In light this, this paper, we propose novel xmlns:xlink="http://www.w3.org/1999/xlink">Disentangled Multimodal Representation Learning (DMRL) model, which can capture users' attention each modeling. particular, employ disentangled representation technique ensure modality independent other. A mechanism is then designed factor. Based estimated weights obtained by mechanism, make recommendations combining scores preferences target over modalities. Extensive evaluation five real-world datasets demonstrate superiority our method compared methods.
منابع مشابه
Linear Disentangled Representation Learning for Facial Actions
Limited annotated data available for the recognition of facial expression and action units embarrasses the training of deep networks, which can learn disentangled invariant features. However, a linear model with just several parameters normally is not demanding in terms of training data. In this paper, we propose an elegant linear model to untangle confounding factors in challenging realistic m...
متن کاملLearning Spatiotemporal-Aware Representation for POI Recommendation
The wide spread of location-based social networks brings about a huge volume of user check-in data, which facilitates the recommendation of points of interest (POIs). Recent advances on distributed representation shed light on learning low dimensional dense vectors to alleviate the data sparsity problem. Current studies on representation learning for POI recommendation embed both users and POIs...
متن کاملRepresentation Learning for cold-start recommendation
A standard approach to Collaborative Filtering (CF), i.e. prediction of user ratings on items, relies on Matrix Factorization techniques. Representations for both users and items are computed from the observed ratings and used for prediction. Unfortunatly, these transductive approaches cannot handle the case of new users arriving in the system, with no known rating, a problem known as user cold...
متن کاملDetach and Adapt: Learning Cross-Domain Disentangled Deep Representation
The recent progress and development of deep generative models have led to remarkable improvements in research topics in computer vision and machine learning. In this paper, we address the task of cross-domain feature disentanglement. We advance the idea of unsupervised domain adaptation and propose to perform joint feature disentanglement and adaptation. Based on generative adversarial networks...
متن کاملDomain Adaptation Meets Disentangled Representation Learning and Style Transfer
In order to solve unsupervised domain adaptation problem, recent methods focus on the use of adversarial learning to learn the common representation among domains. Although many designs are proposed, they seem to ignore the negative influence of domain-specific characteristics in transferring process. Besides, they also tend to obliterate these characteristics when extracted, although they are ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Multimedia
سال: 2022
ISSN: ['1520-9210', '1941-0077']
DOI: https://doi.org/10.1109/tmm.2022.3217449